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Abstract.  We  review  a  number  of  results  relating  the  propagation  of  singular¬ 
ities  for  hyperbolic  partial  differential  equations  —  i.e.  the  persistence,  or  non¬ 
localization,  of  wave  motion  —  with  well-posedness  for  some  inverse  problems  of 
reflection  type,  such  as  arise  for  instance  in  seismology  and  ultrasonics.  By  far  the 
most  complete  information  is  available  for  layered  problems.  We  show  how  a  simple 
but  refined  propagation-of-singularities  theorem,  with  estimates,  yields  important 
functional  properties  of  the  model-data  relationship  for  such  problems,  including 
regularity  in  various  useful  coefficient  classes,  separation  of  scales,  ....  We  explain 
the  essential  role  of  travel  time  in  the  study  of  these  problems,  and  show  how  its 
function  may  be  generalized  to  multidimensional  (i.e.  non-layered)  problems. 

1.  Introduction.  The  “inverse  problems  in  wave  propagation”  of  the  title  are 
idealizations  of  remote  sensing  techniques  such  as  reflection  seismology,  ground- 
penetrating  radar,  and  pulse-echo  ultrasonic  nondestructive  evaluation.  A  wave 
of  some  sort  is  stimulated  by  an  artificial  energy  source.  It  propagates  into  a 
region  containing  the  remote  structure  of  interest;  the  changes  in  physical  properties 
through  the  structure  cause  some  of  the  energy  of  the  wave  to  be  diverted  into  a 
reflected  wave  or  echo,  which  travels  in  roughly  the  opposite  direction,  back  to  a 
region  where  the  experimentalist  has  installed  appropriate  sensors  and  recording 
devices.  The  records  of  reflected  waves  are  to  be  analysed  to  reveal  whatever  can 
be  inferred  about  the  physical  properties  of  the  remote  structure. 

Two  conditions  must  be  fulfilled  for  this  scheme  to  work: 

1.  Energy,  propagating  in  the  form  of  waves,  must  penetrate  the  region  of  the 
remote  structure; 
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2.  A  reflection  or  echo  process  must  take  place,  partitioning  the  energy  of  the 
wavefield. 

The  requirement  that  energy  reach  the  target  area  in  the  form  of  waves  (and 
reciprocally,  return  from  the  target  area  as  waves)  rather  than  as  a  diffusion  process, 
say,  is  motivated  by  a  need  for  spatial  resolution  in  applications.  Evidently  the 
resolution  scale  for  a  wave  process  might  be  measured  in  wavelengths,  whereas 
the  resolution  possible  with  measurements  of  diffusing  fields  is  much  harder  to 
understand. 

Roughly  speaking,  wave  processes  are  modelled  by  hyperbolic  systems  of  partial 
differential  equations.  The  association  is  justified  by  the  geometrical  optics  analysis 
of  high-frequency  solutions,  which  has  its  modern  expression  in  various  propaga¬ 
tion  of  singularities  results.  These  theorems  give  precise  statements  to  the  effect 
that  a  disturbance  which  begins  coherently  along  a  wavefront  will  continue  in  that 
form,  the  wavefront  moving  in  such  a  way  that  its  orthogonal  trajectories  (in  the 
simplest  cases)  obey  a  system  of  ordinary  differential  equations.  The  description 
of  the  disturbance  thus  obtained  corresponds  well  to  the  physical  notion  of  wave, 
and  appears  to  be  correct  when  the  coefficients  of  the  hyperbolic  system,  which 
parameterize  local  physics  of  the  propagating  medium,  are  smooth  on  the  scale  of 
the  wavelength  of  the  disturbance. 

In  particular,  when  these  coefficients  are  smooth  on  a  scale  much  longer  than 
a  wavelength,  the  geometric  optics  approximation  predicts  no  reflected  waves  at 
all.  This  prediction  is  borne  out  quite  well  by  numerical  experiments.  Accord¬ 
ingly,  models  of  reflecting  media  must  involve  coefficients  rough  on  the  scale  of  a 
wavelength,  or  finer.  Morlet  (1982)  has  made  an  extensive  numerical  study  of  the 
reflection  and  transmission  of  plane  waves  in  plane  layered  acoustic  media  (these 
will  be  described  below)  —  that  is,  the  coefficients  of  the  acoustic  wave  equation 
are  functions  of  one  variable  only.  Such  plane-layered  models  are  widely  used  in 
seismology,  as  for  various  reasons  the  mechanical  properties  of  rocks  often  vary 
mostly  in  the  vertical  direction.  Morlet  found  that  once  the  length  scale  of  strong 
variation  in  coefficients  decreased  to  the  level  of  a  wavelength  and  the  propagation 
distance  remained  constant,  the  wave  nature  of  the  propagating  disturbance  was 
strongly  modified,  and  below  some  scale  disappeared  altogether:  the  wave  did  not 
propagate  through  such  strongly  oscillatory  media  at  all.  Since  no  “energy”  pen¬ 
etrated  the  highly  oscillatory  region,  there  were  no  reflected  waves  out  of  which 
to  infer  its  structure.  That  is,  the  wave  was  localized  in  the  complement  of  the 
oscillatory  region. 

Thus  models  of  media  which  are  too  smooth  on  the  wavelength  scale  do  not 
generate  reflected  waves  because  all  of  the  “energy”  in  the  wavefield  passes  through 
unaltered.  Models  which  are  too  rough ,  on  the  other  hand,  generate  no  reflected 
waves  because  no  “energy”  penetrates  to  be  reflected.  (“Energy”  here  merely  means 
some  convenient  measure  of  the  local  size  of  the  field,  which  may  or  may  not  be 
related  to  a  physical  energy.)  Thus  the  questions  are  naturally  posed: 

1.  Do  there  exist  models  of  a  degree  of  roughness/smoothness  just  right  to  per¬ 
mit  wave  propagation  and  generate  sufficiently  informative  reflected  waves? 

2.  Do  such  models,  if  they  exist,  model  real  physical  systems? 


The  first  question  is  mathematical;  partial  answers  will  be  sketched  in  the  following 
pages.  The  second  question  does  not  have  definitive  answers,  so  far  as  the  author 
is  aware,  though  the  motivation  for  question  (1)  is  the  appearance  of  experimental 
data  from  a  variety  of  subjects.  Clearer  answers  to  question  (2)  await  a  combination 
of  improvements  in  theory,  numerical  algorithms,  and  experimental  technique. 

While  the  generalities  should  be  similar  for  the  various  physical  models  support¬ 
ing  wave  propagation  (electromagnetism,  linear  elastodynamics,  . . . ),  the  details 
are  best  understood  for  linear  acoustics,  governed  by  the  linearized  pressure  equa¬ 
tion 

1  d2u  _  1 

— r-x-y  -  V  • -Vu  =  F  in  1R",  n  =  1,2,3,... 
pc 2  at2  p 

where  u(x,t )  is  the  (scalar,  infinitesimal  excess)  pressure  field  in  a  fluid,  p(x)  is  the 
density  in  the  equilibrium  state,  c(x)  is  the  sound  velocity,  and  the  source  term  F 
is  the  divergence  of  a  body  force.  We  shall  consider  both 

(i)  F  vanishes,  and  u  has  the  form  of  a  plane  wave 

u(x,t)  ~U{t  —  6  •  x) 

for  large  negative  t  —  sensible  when  p  and  c  are  constant  in  a  half-space, 
say  (§§2-4); 

(ii)  u  vanishes  identically  for  large  negative  t ,  and  F  has  point  support,  i.e. 

F(x,t)  =  f(t)6(x  —  xs)  (§5) 

The  data  of  an  inverse  problem  of  the  type  considered  here  is  the  trace  of  the 
pressure  field  u  on  a  time-like  hypersurface  —  we  shall  consider  only  a  coordinate 
hyperplane: 

S[p,c;F]  =  u\xn=0  . 

Of  course,  in  reality,  measured  signals  are  sampled ,  both  spatially  and  temporally, 
and  also  filtered  by  the  recording  apparatus.  In  this  paper  we  ignore  such  refine¬ 
ments. 

Since  the  local  “physics”  of  this  model  are  determined  entirely  by  p  and  c,  the 
object  of  the  inverse  problem  is  to  determine  (to  the  extent  possible)  p(x)  and  c(x) 
for  xn  /  0  from  S[p,c]F],  (F  will  be  regarded  as  known,  though  in  fact  it  should 
be  included  in  the  unknowns  for  many  models). 

By  far  the  best  results  are  available  for  the  plane- wave  problem  for  layered  me¬ 
dia,  i.e.  p  =  p(xn),  c  =  c(xn).  In  Section  2  we  state  a  simple  but  quite  precise 
propagation-of-singularities  result  for  such  problems,  and  exhibit  the  plethora  of 
consequences  which  follow  from  it:  regularity  properties  of  the  map  S ,  rough  cor¬ 
respondence  between  scales  in  u  and  scales  in  p ,  c,  etc.  We  describe  key  results  in 
the  simplest  form,  and  refer  the  reader  to  the  literature  for  the  most  general  ver¬ 
sions.  In  §3  we  show  how  these  consequences  of  a  refined  propagation-of-singularity 
result  can  be  used  to  investigate  the  plane-wave-inverse  problem.  The  end  result 
is  a  well-posedness  result  for  “mildly  rough”  models:  thus  we  are  able  to  give  an 
affirmative  answer  to  question  (1)  above  in  this  special  instance.  A  very  important 


role  is  played  in  this  development  by  the  reparameterization  of  the  layered  model 
by  travel-time:  without  this  device,  essentially  nothing  can  be  accomplished,  either 
theoretically  or  numerically.  In  Section  4,  we  give  a  very  brief  treatment  of  rougher 
media,  for  which  a  dichotomy  of  scales  exists,  so  that  homogenization  estimates 
can  be  used  to  reduce  the  problem  to  the  mildly  rough  case. 

The  plane-wave/layered  medium  problems,  while  forming  an  excellent  arena  for 
development  of  insight,  offer  limited  opportunity  for  immediate  practical  applica¬ 
tion.  We  consider  the  general  multidimensional  “point-source”  problem  (case  (ii)) 
in  §5.  Very  little  is  known  about  this  class  of  problems,  relative  to  the  layered  case. 
We  address  two  issues: 


(1)  proper  definition  of  the  map  S  on  non-smooth  coefficient  classes; 

(2)  location  of  a  substitute  for  the  travel-time  transformation. 

For  both  issues,  we  can  offer  only  fragmentary  information.  We  are  able,  how¬ 
ever,  to  explain  the  function  of  the  travel-time  transformation  for  the  plane-layered 
problems  in  such  a  way  that  its  functional  substitute  for  the  non-layered  problem  is 
(essentially)  uniquely  specified.  We  give  a  construction  of  this  “multidimensional 
travel-time  transformation”  under  very  restrictive  smoothness  assumptions,  which 
(despite  the  restrictions)  indicates  that  such  things  are  actually  possible  —  this  is 
the  only  original  result  of  the  paper. 

Most  of  the  results  and  insights  reported  here  are  the  result  of  joint  work  with 
many  colleagues  insofar  as  they  are  due  to  the  author  at  all.  Especially  Fadil 
Santosa  and  Paul  Sacks  deserve  much  credit  for  whatever  is  worthwhile  in  the 
following  pages.  Special  thanks  are  also  due  to  Rakesh,  R.M.  Lewis,  Cheryl  Percell, 
and  Gang  Bao. 

2.  Some  Simple  Propagation-of-Singularities  Results  in  Layered  Media,  and 
Consequences.  When  the  density  p  and  sound  velocity  c  are  smooth  and  functions 
of  only  one  space  coordinate,  say  z,  then  the  acoustic  wave  equation 

(  1  d2  „  1  ' \ 

\Pc2dt 2  v‘pV)“~°  (2-1) 

has  plane-wave  solutions ,  i.e.  the  pressure  u(x,z,t )  is  in  the  form 


u(x,z,t)  =  U(z,t  —  £  •  x) 


Such  solutions  exist  for  £  G  IR"  so  long  as  c(z)|£|  <  1  for  all  z.  Evidently  U  is  a 
solution  of 


1 

P 


d_ld_ 

dz  p  dz 


U  =  0  . 


(2.2) 


Write  v  —  (c-2  —  |£|2)-^  and  suppress  £  from  the  discussion  for  the  remainder  of 
this  section.  If  v  =  u0  =  const,  for  z  <  0,  and  a0  G  IR,  then  there  exists  a  unique 
distribution  solution  to  (2.2)  with 


U(z,t)  =  a0H(t  -  zjv 0) 


for  t,z  <  0.  Standard  constructions,  e.g.  Courant  and  Hilbert  (1962,  Ch.  V)  imply 
that  U  has  the  progressing  wave  expansion 


U(z,t )  =  a(z,t)H(t  —  r(z)) 


where 


is  the  travel  time. 


To  pursue  the  agenda  outlined  in  §1,  one  needs  explicit  estimates  of  a  in  terms 
of  norms  of  p,  c.  Also,  it  will  be  advantageous  to  consider  the  slightly  more  general 
problem 


1  d2U 
pv2  dt2 


d_  /I  dU\ 
dz  \p  dz  ) 
U 


=  ipo(z)8(t  -  t(z ))  +  ipi(z)w(z,  t)H(t  -  t(z)) 
=  0,  t  «  0 


with  o,  "0i  and  w  smooth.  A  simple  calculation  shows  that  we  should  take  U  = 
aH(t  —  r)  with  a  the  solution  of  the  characteristic  Cauchy  problem 


1  d2a  d  (l  da\ 
pv 2  dt 2  dz  \pdz ) 


—a(z,T(z))  -  a(z)a(z) 


ipiwH(t  —  r )  for  t  >  t(z )  ; 
p(z)v(z)il>0(z),  z  >  0 


a(0,0)  =  g0  (2.3) 

where  a(z)  =  ^(log pv)(z)).  Following  arguments  in  Symes  (1986a;  Section  1),  we 
obtain  estimates  for  the  “sideways  energy  form” 


of  the  form: 


dU 

dz 


\\Qt\\l°°[0,Z(Y)]  < 

C  (|I?/,o||l2[01Z(T)]  +  aolMlL2[0,Z(X)] 

f  r2T-r(z)  "j  \ 

+  \\xl’i\\h[o,z(T)]  SUP  W/x  (2.4) 

0 <z<Z(T)  (.“'t(z)  ]  J 

for  suitable  C  >  0  depending  only  on  \\p\\m[o,z{T)],  ||y||Hi[o,z(X)]-  Here 

Z(T)  —  inf  {z  :  t(z)  >  T}  . 

Since  we  also  have  control  of  the  characteristic  boundary  condition  a(z,r(z)) 
through  (2.3),  an  identical  bound  follows  for  the  Hl  norm  of  U  in  L(T )  =  {( z,t )  : 
0  <  z  <  Z(T ),  t(z )  <  t  <  2 T  —  t(z).  On  the  other  hand,  (2.4)  is  stronger  than 
such  a  statement,  and  shows  that  the  traces  of  first  derivatives  of  U  on  the  vertical 
line  segments  {z  =  const.}  are  well  defined. 


Thus  (2.4)  is  a  rather  strong  propagation-of-regularity  result  —  probably  the 
strongest  that  can  be  expected  to  hold  uniformly  over  p,  v  in  C°°n  a  bounded  set 
in  H\OC‘  ^  immediately  allows  us  to  extend  the  definition  of  various  maps  beyond 
the  class  of  smooth  coefficients.  For  example,  suppose  that  a0  =  1,  pl5p2  both  lie 
in  the  same  bounded  set  in  H\oc  H  C00,  and  v  =  1.  The  difference  V  =  U2  —  U\ 
satisfies 

d2v  d2v  ,  dv  ,  xdu2 

dz 2  dz2+aidz~^2  ai)  dz 

where  crt  -  £  logpt-,  i  =  1,2.  From  (2.4)  with  ip0  =  fa  =  w  =  0  and  a0  =  1,  we  get 
a  bound 


f2T-r(z) 

/  dt 

dU2 

Jt(z) 

dz 

< 

< 


Q(Z )  <  C\\<72\\l2[o,Z(T)] 

C\\p2\\m[0,Z(T)} 


so  we  can  use  (2.4)  again  with  0O  =  0,  -  au  w  =  dU2/dz ,  and  a0  =  0  to 

get 


L 


2T-r(z) 

w 


dt 


dV 


dt 


+ 


dV 


dx 


<  C\\cr2  -  c ! 


whence  a  Lipschitz  estimate  follows  for  the  map 


|2 

Il2[o,z(t)] 


:  logp  €  HX%Z{T))  ~  €  T2[0, 2T] 

for  example,  uniform  over  bounded  sets  in  i/1[0,Z(T)]  fl  C'00(IR).  Consequently 
the  map  extends  —  in  some  sense,  we  have  “solved”  this  singular  problem  with 
non-smooth  (H1)  coefficients. 

In  Symes  (1986a)  these  arguments  are  developed  further  to  show  that  Sg  is 
actually  a  C1  diffeomorphism  and  further  work  along  the  same  lines  shows  that  S$ 
is  C2.  It  is  probably  C°°.  Since  Sg  is  differentiable,  estimates  for  the  derivative 
are  useful.  It  is  possible  to  show,  by  very  similar  arguments,  that  the  formal 
linearization 


SIT--—-  — 
\pu2  dt2  dz  p  dz)  dz  p  dz 

actually  defines  the  derivative 


su  =  0  ,  t  <<  0 


DSg[p\6p=^f(  0,-). 

Moreover, 


C*\\&p\\h'[o,z(t)]  <  \\DSg[p}8p\\L2[Q'Z(T)]  <  C*\\8p\\m[0'Z{:r)] 
where  the  constants  0  <  C,  <  C*  are  uniform  as  log  p  ranges  over  bounded  sets  in 

H\oc  Similarly> 


DSn[p\6p  =  *U{  0,-) 


satisfies 


\\DS'h[p\8p\\l?[o,2T]  <  Ch\\8p\\l^[o,z(t)} 

with  another  ifj^-uniform  constant  Ch,  though  no  lower  estimate  seems  to  obtain 
for  DSh. 

These  estimates  show  that  there  is  approximate  separation  of  scales  in  the  rela¬ 
tion  between  coefficients  and  solution,  in  the  following  sense.  A  convolution  kernel 
/  €  £'  is  elliptic  on  H^oc  if  there  exist  constants  K0,  Ki,K*  so  that  for  u  G  H1( IR), 

tfilMli  _  ^o||u||0  <  ||/  *  u ||i  <  A'*||u||i  . 

For  example,  if  |/(u>)|  >  p  for  |u;|  >  fi*,  then  one  could  take  crudely  K\  =  p,  K0  = 
(1  +  £ll)?p  (much  more  precise  statements  are  possible).  Thus  the  relation  between 
the  constants  K0,  K\  measures  the  threshold  of  the  “passband”  of  /.  Such  /  may  be 
manufactured  by  subtracting  from  the  Dirac  8  function  a  smooth  approximation. 
Without  loss  of  generality  we  may  assume  that  /  is  “causal”,  i.e.  supp  /  C  [0,  oo). 
As  a  consequence  of  the  above  estimates  for  DSg ,  the  derivative  of 

Sf  =  f*Ss 

satisfies 


\\DSf[p]8p\\  >  I<\\\8 p\\Hi[Q<z(T))  -  A'o||^||l2[o,z(t)+z/] 

-  Kmmi(f)\\6p\\Hi[Z(T),z(T)+zf]  (2.5) 

where  K\ ,  K0  and  Km  depend  on  ||  l°g/>||/f>[o,z(r)],  the  constants  K*,K\  and  K0  in 
the  ellipticity  estimate  of  /,  and  Km  and  Zf  depend  also  on  the  first  moment 


mi{f)  =  inf  sup 

<o€K0€C5°(R) 


(/,  (t  - 10)4) 


of  /. 

The  first  two  terms  in  the  preceding  estimate  show  that  DSj  has  essentially  the 
same  “filter”  or  separation-of-scales  effect  as  does  /*,  at  least  qualitatively.  The 
last  term  is  necessary  because  the  “breadth”  of  /,  as  measured  by  its  first  moment 
m\ (/),  may  be  nonzero,  so  that  control  of  8Sf  is  needed  over  a  slightly  longer  time 
interval  —  or  equivalently  of  8 p  over  an  “extra”  depth  interval  —  to  ensure  control 
over  8p  on  [0,  Z(T)\.  Both  the  coefficient  and  the  size  Zf  of  the  extra 

depth  interval  are  0(mi(/)). 

In  order  to  deal  with  truly  “bandlimited”  kernels  —  i.e.  /  £  C^°  as  well 
—  higher-order  estimates  are  needed.  Suzuki  (1988)  showed  (essentially)  that 
Sg'  —  d/dtSg  obeys  an  estimate 


||-D5s/[p]<!>p||£2[o,2T]  <  C2\\8 p\\hi[o,z(t)] 

using  extensions  of  the  arguments  sketched  above;  the  constant  is  uniform  over 
bounded  sets  in  H^oc-  This  estimate  can  be  used  to  study  Sf  when  f*  obeys  an 
estimate  of  the  form 


Once  again,  this  estimate  can  be  interpreted  loosely  as  identifying  the  “passband” 
of  /.  If  |/(u>)|  >  p  >  0  for  fl*  <  |oj|  <  0*,  one  can  take  I{1  =  p ,  K0  =  p(  1  +  f 
as  before  and  K2  =  p{  1  +  ft*)- 2.  There  results  for  DSf  an  estimate  of  the  form 

\\DSf[p\6p\\LHOt2T]  >  K\\\Sp\\w[o,z(T)]  — Ko\\Sp\\l2[o,z(t)] 

~  ^2||^p||i/2[0,Z(T)] 

-  Kmm1(f)\\6p\\Hi[Z^T)tZ^T)+Zf] 

all  dependences  being  as  before,  except  for  I<2  which  is 

(a)  uniform  over  bounded  sets  in  Hfoc  for  fixed  /; 

(b)  for  p  in  a  fixed  bounded  set  in  ufoc’  = °(^y 

Thus  the  estimate  is  nontrivial  if  K2  is  small  enough,  i.e.  if  /  is  “close  enough  to 
elliptic”  relative  to  ||p||j/i[o,zmi. 

To  obtain  similar  results  for  the  map  tn->  1/,  or  (p,  v)  U,  one  must  impose 
much  more  smoothness  on  v,  essentially  because  a  change  in  v  actually  moves  the 
characteristics.  See  e.g.  Lewis’s  thesis  (1989)  or  Symes  (1986b).  The  diffeomor- 
phism  is  lost:  in  metrics  strong  enough  to  make  v\ U  differentiable,  the  derivative 
is  not  coercive. 

3.  Travel  Time  and  the  Determination  of  “Mildly  Rough”  Coefficients.  Maintain 
the  setting  of  the  previous  section,  but  now  consider,  for  simplicity,  the  case 
p  =  const.  (=  1)  instead.  Introduce  the  ( travel-time )  change-of-variable 

x  =  t(z) 

(NOTE:  we  are  using  “x”  in  a  different  sense  than  at  the  beginning  of  §2!)  Denoting 
by  (  the  inverse  function  of  r,  so  that 

/•C(D  1 

x  =  / 

Jo  V 

introduce 

u(x)  =  u(C(x)),  U(x,t)  =  U(((z),t)  . 

A  short  calculation  verifies  that 

U(x,t)  =  H(t  —  x),  x,t  <  0  ; 

d2U  d2U  _  dU 
dt 2  ”  5x2  _  a  ~frx  ~  ° 

where  Sr  =  ■£;  log  v.  Thus  v  plays  exactly  the  same  role  as  p  did  at  the  end  of  the 
last  section,  hence  the  map 

S,  :  v  €  ff'lO.T]  -»  ^(0,  )  €  L2[0,2T] 


is  well-defined  by  continuous  extension  from  v  G  C°° ,  is  a  C 2  diffeomorphism,  and 
all  the  “quasi-elliptic”  estimates  in  §2  apply  ipso-facto  to  Sf  —  f  *  Sg. 

Remark  In  §5  some  commentary  is  offered  on  the  “real”  significance  of  this  change 
of  variable. 

To  understand  how  results  about  Sj  might  yield  information  about  v  (rather 
than  v),  examine  the  change-of  variables  operator: 

M[v\  =  v  o  (  . 


It  is  possible  to  show  that 

(i)  M  extends  to  a  continuous  map  defined  on  the  positive  cone 
IT  =  {u  G  #foc[0,  oo)  :  u(0)  =  u0,  v(z)  >  0  Vz} 
and  taking  values  in  V,  for  k  >  1; 

(ii)  M  is  nowhere  locally  uniformly  continuous,  hence  a  fortiori  not  differ¬ 
entiable,  in  IV 

(hi)  Regarded  as  a  map:  V+2  -+  V,  M  is  differentiable. 

For  discussion  and  proofs  of  related  results  see  Lewis  (1989)  and  Symes  (1986b). 
The  continuity  result  is  due  to  Rakesh  (unpublished). 

Formally,  the  derivative  of  M  is  given  by 

6M(t)  = 


Thus  the  “principal  part”  of  6M  is  the  change- of- variables  for  u,  applied  to  8v  — 
the  simplest  sort  of  Fourier  integral  operator.  In  fact,  if  we  assume  that  v  =  const, 
for  large  Z,  then 

— M[v]  =  J  dk  a(t,k)et^k  (&) 

where  the  phase  £  itself  depends  on  v  —  i.e.  dM / dz  is  a  “nonlinear  Fourier  integral 
operator.” 

On  the  other  hand,  the  appearance  of  v'  in  the  second  term  above  hints  at  the 
nondifferentiability  of  M  : 

Now  suppose  that  /  is  an  elliptic  convolution  kernel,  as  in  the  previous  section. 
Appraisal  of  the  “Garding”  estimate  (2.5)  shows  that,  if  8v  is  sufficiently  smooth 
—  that  is,  ||<5v||L2  is  sufficiently  big  relative  to  ||^u||//i  —  the  estimate  has  no 
force.  If  8v  =  8M:  with  smooth  8v,  however,  it  may  be  that  5v  (rather  than  8v)  is 
controlled.  In  fact,  if  ||w'||//i  is  large,  one  might  suspect  that  the  second  term  in  the 
expression  for  SM  dominates  ||  /  ^||L2,  hence  (for  smooth  8v)  ||<HU2  itself.  This 
idea  is  embodied  in  the  “rough/smooth”  lemma  (see  Symes  (1988),  Lemma  3): 


Ha*)) + v’imsat) 


For  u,  $  €  C°°,  a  <b,  A  >  0,  set 


Then 


r*( A)  =  inf  r(x,A),  r*  =  sup  r(x,A) 

x<Eta>6]  i:G[a,6] 


ll^UllL2[a,6] 


> 


-^(r,(A)+r-(A))A2||<I>'||£,M] 


(3.7) 


Apply  this  estimate  to  the  second  term  in  SM :  with  u  = 


(|n2)  ,$  =  /•§,  we  get 


6v  +  vv 


'/$)•< 


H1  [0,T] 


>  Cxr*(  A) 


/ 


8v 


L*[0,Z(T)) 


^2||^w||h1[0  ,Z(T)] 


C3(r*(A)A!  +  1)  lIHllw)] 


where  Ci,  C2,  and  C3  are  uniform  for  v  in  bounded  sets  in  H j1  .  Thus  if  8v  is 
smooth  —  so  that  ||<5i;||tfi,  ||<5t>||£,2  and  ||/6t>||£,2  are  comparable  —  and  if  v"  is 
uniformly  big  relative  to  ||u||/yi,  locally  on  a  length  scale  of  A,  and  A  is  small 
enough,  —  then 

||^Af||//i[0)T]  >  C\\8v\\L2[0'Z(T)}  .  (3.8) 

Define  Sf  =  Sf  o  M.  Combining  (3.8)  with  (2.5)  we  see  that  if  in  addition 
IHU2  is  big  enough  relative  to  ||6u||ffi,  then 


I|£51|l2[0,2T]  >  C'jj5u||L2[o,z(j-)] 

(presuming  8v  =  0  in  {Z[T],  Z[T ]  +  Zf)  in  order  not  to  have  to  worry  about  the 
“margin  correction”).  That  is,  8S  is  coercive,  in  a  certain  sense,  for  very  smooth 
8v.  On  the  other  hand  it  is  plain  that  8S  is  coercive  for  oscillatory  8v. 

Unfortunately,  that’s  as  far  as  the  purely  1-D  problem  goes.  If  8v  is  neither 
purely  smooth  nor  purely  oscillatory,  it  is  possible  to  arrange  for  the  two  parts  of 
8M  to  very  nearly  cancel,  even  when  v  satisfies  the  “uniform  roughness”  condition 
sketched  above,  so  that  8S  cannot  possibly  be  coercive. 

The  way  out  is  to  remember  —  from  the  beginning  of  §2  —  that  each  choice  of 
sound  speed  profile  c(z )  (keeping  p  constant  still)  gives  a  suite  of  one- dimensional 
problems  with  plane-wave  velocities 

=  c(z)(l  -  c2(z)£2)-^  . 

The  change-of- variables  operator  M  and  the  seismogram  operators  Sf  thereby  ac¬ 
quire  dependence  on  £  as  well,  of  course.  The  perturbation  in  v  is  related  to  the 
perturbation  in  c  by 

8v  =  6c(l  —  c2£2)~2 


so 

SM  =  (1  —  c2£2)-t  Sc  -  cc'(  1  -  c^2)"*  f  — (1  -  c2£2H] 

L  c  \  z=t 

It  is  not  surprising  that,  while  8M  can  be  made  to  vanish  at  one  value  of  £,  it 
cannot  be  made  to  vanish  over  a  range  of  £  simultaneously.  In  fact,  a  coercivity 
estimate  is  possible  of  the  form 

1 1 SM  1 1 /f 1  ( [0,T]  X  [f  min ,£max])  >  C \\Sc\\L2[0<Z^T)] 

when  C  is  “rough  enough.” 

The  preceding  is  only  the  barest  sketch  of  an  argument,  of  course.  In  Symes 
(1988)  we  have  shown  that  all  of  the  choices  mentioned  above  can  actually  be  made. 
The  end  result  is  the  identification  of  a  class  £  C  //j1  of  velocity  profiles  c  and 
of  suitable  “elliptic”  kernels  /  which  permit  stable  solution  of  the  inverse  problem. 
More  precisely,  for  each  elliptic  kernel  /,  there  exists  a  neighborhood  Vj  of  5/(2) 
in  fqoc  and  a  locally  Lipschitz  map  If  :  Vf  — *  L\oc  satisfying 

Ij(Sf(c))  =  c  for  c£E. 

When  /  is  “bandlimited,”  i.e.  smooth  but  obeying  a  quasi-elliptic  estimate  like 
(2.6),  a  similar  result  is  obtained  via  regularization:  if  the  “upper  bandlimit”  pa¬ 
rameter  I<2  is  small  enough,  a  stable  inverse  map  If  exists  for  which  [If(Sf(c))  —  c] 
is  small  —  as  usual  with  regularization,  one  does  not  obtain  exact  recovery  of  the 
model.  The  degree  of  error  (for  consistent  data  in  S/(E))  can  be  made  as  small  as 
one  likes,  at  the  usual  price  of  degradation  of  stability. 

The  construction  of  I  depends  on  a  variational  problem  of  least-squares  type. 
This  variational  problem  has  served  as  the  basis  of  a  computer  program  to  solve 
the  inverse  problem  numerically,  which  has  yielded  quite  promising  results  when 
applied  to  both  synthetic  and  field  seismic  data.  For  details  see  Symes  (1988), 
Carazzone  and  Symes  (1989),  and  references  cited  there. 

4.  Very  Rough  Models  and  Homogenization.  The  sedimentary  crust,  amongst 
other  real  materials,  does  not  appear  to  possess  the  smoothness  relative  to  wave¬ 
lengths  and  other  scales  necessary  to  make  plausible  direct  application  of  the  results 
of  the  last  section.  Instead,  very  rough  models  may  be  approximated  in  a  weak  sense 
by  smooth  ones,  provided  that  a  dichotomy  of  length  scales  is  present. 

For  1-dimensional  wave  propagation,  a  version  of  such  an  approximation  scheme 
may  be  based  on  the  quantitative  homogenization  estimates  developed  by  Bam¬ 
berger  et  al.  (1979).  Specialized  to  the  present  context,  this  estimate  amounts 
to 

~  Sf[c2]\\L2[0,2T]  <  C||/||ff<(m)r(Ci,C2) 
for  log  ci,  log  c2  €  The  pseudo-distance  T  is  defined  by 

T(ci,c2)=  sup  f  dz'f -J-yr-  -ir) 

0  <z<Z  Jo  Vw)  C2\Z)J 


where  Z  is  chosen  sufficiently  large. 


If  E  is  the  set  of  models  described  in  the  previous  section,  Vf  the  neighborhood 
of  and  /  is  a  “quasi-elliptic”  but  smooth  kernel,  we  can  associate  to  the 

pair  (E,/)  the  set  S /  of  logc  £  f°r  which  5/[c]  €  V}.  It  is  evident  from  the 
definition  that  E /  contains  quite  a  bit  more  than  an  ^-neighborhood  of  E.  An 
easy  way  to  see  this  is  to  pick  a  Dirac  kernel  <j>  £  C£°( IR),  <f>(0)  ±  0,  ffi  =  1,  and  set 
for  e  >  0  <f>e{z)  =  e-1<^(e-1z).  Roughly,  e  is  the  “width”  of  <f>t,  and  the  convolution 
with  <f)t  averages  on  the  length  scale  e.  Then  for  log  c  £  L^c 

r(c,  (f)e  *  c-2)-2  =  O(e)  . 

Thus  c  £  £/  if  <j)t  *  c  £  E  for  e  small  enough  that  S/[c]  £  Vf.  Such  c  can  have 
arbitrarily  large  variation,  provided  that  it  is  on  a  sufficiently  fine  length  scale  that 
averaging  over  length  scale  e  =  0(||/||^ )  yields  an  element  of  E.  E  is  characterized 
also  by  being  a  subset  of  a  large  ball  in  H 2,  the  radius  being  determined  by  the 
“upper  bandlimit”  of  the  kernel  (i.e.  K2  in  2.6).  Thus  the  scales  permitted  in 
E  are  determined  by  this  upper  bandlimit.  All  significant  components  at  shorter 
scales  must  be  removed  (essentially)  by  averaging  on  the  length  scale  e.  Thus  the 
difference  c  —  (<f>c  *  c~2)~ 2  must  be  oscillatory  on  a  scale  smaller  than  e,  and  much 
smaller  than  that  permitted  by  membership  in  E. 

These  choices  of  scale  can  be  made  precise.  In  an  unpublished  technical  re¬ 
port,  the  author  made  most  of  these  choices  explicit  and  investigated  the  question 
numerically.  The  results  lent  credence  to  the  heuristic  rule: 

For  effective  recovery  of  the  layered  medium,  the  velocity  profile  must  consist 
of  a  “core”  medium  which  oscillates  on  a  wavelength  scale,  and  perhaps  a 
very  rough  component  oscillating  on  a  subwavelength  scale. 

Such  a  clean  frequency  dichotomy  is  probably  not  seen  in  nature;  in  fact  some 
evidence  exists  that  actual  velocity  distributions  are  fractal.  Thus  oscillatory  com¬ 
ponents  exist  at  all  scales,  and  the  preceding  heuristic  condition  is  not  satisfied. 
It  is  not  clear  whether  the  “mildly  rough”  results  bring  with  them  results  about 
media  without  the  frequency  dichotomy  described  here. 

Papanicolau,  White,  and  colleagues  have  studied  another  class  of  media  admit¬ 
ting  a  dichotomy  of  length  scales  (Burridge  et  al.  (1989)).  The  principal  difference 
in  problem  setting  between  our  work  and  theirs  is  in  the  relation  between  the  source 
frequency  content  and  the  length  scale  of  oscillation  in  the  coefficients.  Our  work 
pertains  to  a  fixed  source,  hence  fixed  bandwidth,  and  the  limit  as  the  length  scale 
for  the  coefficients  goes  to  zero.  White  et  al.  consider  instead  a  family  of  sources 
with  frequency  increasing  roughly  as  the  square-root  of  the  frequency  of  oscillation 
in  the  parameters  (after  normalization  to  fixed  propagation  time).  Thus  in  their 
work  the  oscillations  in  the  coefficient  continue  to  interact  with  the  oscillatory  en¬ 
ergy  in  the  solution,  which  does  not  approach  that  of  a  smooth-coefficient  equation. 
It  is  not  clear  which  asymptotic  regime  is  more  important  in  explaining  practical 
phenomena.  Most  likely  both  regimes  are  present  in  real  data. 

5.  Multidimensional  Problems.  We  return  to  the  problem  specified  at  the  be¬ 
ginning  of  Section  2,  viz.  linear  acoustics,  but  this  time  in  dimension  n  >  1  and 


with  the  coefficients  allowed  to  depend  on  all  the  space  variables.  We  introduce 
“energy”  into  the  system  via  a  “point  source,”  i.e.  a  right-hand  side  with  point 
support  in  x : 


(  i  d 2 

\p(a:)e2(:r)  dt2 


u(x,t)  =  f(t)S(x) 


u  =  0,  t  <  0  . 


Once  again  we  are  interested  in  estimates  which  hold  for  elliptic  /,  uniformly  over 
bounded  sets  of  coefficients  in  Sobolev  spaces  and  especially  in  estimates  for  time¬ 
like  traces. 

Very  little  is  known  about  these  problems,  and  the  following  discussion  is  meant 
mainly  to  illustrate  some  of  the  remaining  difficulties.  Some  partial  results  may 
be  found  in  Rakesh  (1988),  Sacks  and  Symes  (1985),  and  Sun  (1987).  Two  major 
differences  emerge  compared  to  the  layered  case.  The  first  of  these  is  that  con¬ 
siderably  more  coefficient  regularity  is  needed  to  assure  the  optimal  regularity  of 
timelike  traces.  To  fix  ideas,  take  /  =  above,  and  assume  that  logc  £  C°° 

is  fixed.  Near  x  =  t  =  0,  Hadamard’s  construction  (Courant  and  Hilbert,  Ch.  6) 
yields 

u(x,t)  =  a(x)p~*(x)H(t  -  t(x))  +  Ro(x,t) 

where  r(x)  is  the  geodesic  distance  from  x  to  0  in  the  metric  c~2Y^idx{dxi  ,  a  is 
a  C°°  transport  coefficient,  and  R0  vanishes  at  t  =  t(x).  This  expression  remains 
correct  so  long  as  we  encounter  no  point  conjugate  to  0,  i.e.  as  long  as  no  caustics 
appear  in  the  wave-front,  which  we  assume  (the  complication  in  this  analysis  arising 
from  caustics  have  not  been  understood,  nor  has  the  effect  of  changing  c  —  see 
some  discussion  below).  Thus  u  may  as  well  be  viewed  as  the  solution  of  an  interior 
characteristic  Cauchy  problem. 

It  is  natural  to  attempt  to  bound  the  energy  norm :  With 


Bt  =  {x  :  t(x)  <  T} 

Ct  =  {(<M)  :  t  —  t(x)  <  T} 

we  get  the  standard  energy  identity 


f  , 

'  1 

du 

i  |2 

[  1 

du 

/  ax 
>bt 

pc 2 

dt 

+  -  v« 2 

p 

=  - 

J  Ct  p 

ttVt  +  Vu 
dt 

The  transport  equation  (Hadamard’s  construction,  above)  yields  all  of  the  deriva¬ 
tives  of  u  tangential  to  Ct,  but  the  normal  derivative  is  also  needed  to  evaluate 
the  R.H.S.  of  the  energy  identity.  The  normal  derivative  —  or  equally  well,  the 
i-derivative  —  trace  on  Ct  is  determined  by  the  second  transport  equation, 

Vr  •  Vai  -f  (b  +  V  log  p  ■  Vr)ai  = 

(A  —  V  log  p  •  V)(ap-2 ) 
du 


Here  b  depends  on  the  bicharacteristic  geometry,  hence  on  c.  Thus  second  deriva¬ 
tives  of  p  are  involved,  and  will  not  be  “integrated  out,”  an  important  difference 
with  the  layered  case. 

A  certain  amount  of  evidence  suggests  that  the  proper  way  to  resolve  this  and 
other  problems  is  to  impose  microlocal  restrictions  on  the  coefficients.  Fundamen¬ 
tally,  all  of  this  evidence  rests  on  Rauch’s  Lemma  (Rauch,  1979),  which  we  state 
here  in  a  lopsided  form  suitable  for  application  to  linear  problems,  using  now- 
conventional  notation  (Beals  and  Reed,  1984): 

Suppose  that  for  some  (x0,{0)  G  T*IRn,  the  distributions  a,u  satisfy  a  € 
H°  n  Hrmt(x  o,£o),  ueffn  Hqm((x  o,6>)  with  0  <t<s,  q<t  +  r-s, 
q<s  +  t  —  n/ 2.  Then  a  u  e  Hf  f]  H^e(x 0,£o). 

That  is,  Hl  D  H^(x 0,£o)  is  an  ( Hs  fl  H^^xq,  £o))~algebra.  The  lemma  is  usually 
stated  for  t  =  s,  q  =  r,  as  is  appropriate  for  applications  to  nonlinear  problems. 
For  linear  problems  such  as  that  considered  here,  the  solution  may  well  have  a 
different  degree  of  smoothness  than  the  coefficients  —  in  fact,  the  solution  u  above 
is  expected  to  have  a  jump  discontinuity.  Such  strong  singularities  are  much  more 
difficult  to  treat  in  nonlinear  problems,  and  most  of  the  results  based  on  Rauch’s 
lemma  and  related  ideas  concern  much  weaker  singularities  —  see  e.g.  Beals  and 
Reed  (1982,  1984).  Nonetheless,  we  expect  to  be  able  to  understand  the  relation 
between  the  coefficient  and  solution  regularity  away  from  the  leading  singularity 
or  wavefront.  For  example,  a  quantified  version  of  Rauch’s  lemma  enables  one 
to  prove  microlocal  energy  estimates  for  first-order  hyperbolics  like  the  transport 
equation  above.  In  the  present  instance,  we  obtain  ax  €  H“  if  p  G  iP+2,  s  >  l+n/2 
(n  =  number  of  space  dimensions)  plus  microlocal  conditions.  Then  Theorem  2.1  in 
Beals  and  Reed  (1984)  implies  that  at  G  H^1  away  from  the  characteristic  variety 
of  the  transport  operator.  One  then  iterates  this  construction  to  produce  higher 
energy  estimates  for  u  interior  to  the  light  cone. 

Another  piece  of  evidence  in  favor  of  microlocal  coefficient  classes  is  the  following 
trace  theorem  due  to  Bao  (1989):  it  concerns  the  case  c  =  1. 

Suppose  s  >  3  +  n/2  and 

(i)  [gr  -  A  -  V<r  •  V]  u  =  /  in  IRn+1 

(ii)  u,f  =  0  for  t  «  0 

(Hi)  T  is  a  closed  conic  set  containing 
{(*,0:  Kn|<e|a  \x\<e} 
for  small  e  >  0  (here  £'  =  (&,...  £„_i ) ) 

(iv)  u,  Vcr,  and  f  G  LP-1  0  H^e(T) 

Then  for  G  C’^°(IRn+1), 

MXn= o  €  H°  . 

Thus  traces  are  as  regular  as  the  solutions,  and  particularly  as  regular  as  the 
coefficients  allow  —  but  now  at  a  higher  degree  of  smoothness  than  in  the  layered 
case,  and  only  with  microlocal  restrictions  against  tangential  oscillations. 


A  great  deal  remains  to  be  said  along  these  lines,  but  at  least  one  can  imagine 
that  the  problems  are  in  some  sense  “technical.”  The  very  delicate  dependence  of 
the  solution  on  the  principal  part  of  the  wave  operator  —  physically  on  the  veloc¬ 
ities  —  poses  a  quite  different  sort  of  difficulty,  which  forms  the  second  topic  of 
this  section.  For  layered  media  and  plane-wave  solutions,  the  travel-time  transfor¬ 
mation  was  available  to  regularize  this  dependence  —  by  making  the  coefficients 
of  the  principal  part  constant!  No  such  possibility  exists  for  n  >  1.  To  see  how 
the  essential  role  of  the  travel-time  transformation  may  nonetheless  be  filled,  we 
examine  once  more  the  one-dimensional  case. 

Recall  from  Section  3  that  we  could  write 

M[r]{t)  =  j  dka(t,k)ei(:[r](t)kr(k ) 

where  r  =  dv/dz  and  is  presumed  to  have  compact  support,  and  £[r]  is  the  corre¬ 
sponding  inverse  travel- time  function. 

Consider  more  generally  a  family  of  oscillatory  integral  operators  of  the  form 
G[u\<f>(x)  =  f  dy  (  dO  a(u ;  x ,  y, 

J  ]Rn  J  ]R 

where  the  symbol  a  and  phase  <f>  are  Gateaux-differentiable  functions  of  u  6 
Co°(IR),  to  begin  with,  and  assume  that  G  is  elliptic.  From  the  calculus  of  such 
operators,  it  follows  that  the  formal  perturbation 

£Cr[u;  6u\cj)(x)  =  J  dy  J  d6  (Sa(u,Su;  x,  9) 

+  ia(u ;  x,  e)8(f){u,  8u ;  x,  y,  6)) 
is  of  order  higher  than  that  of  G,  and  can  be  expressed  as 

8G[u,8u]  =  G[u]P[u,8u]  =  P[u,8u]G[u] 

where  P ',  P  are  elliptic  pseudo  differential  operators  of  order  >  0  with  symbols 
linearly  dependent  on  8u.  The  travel-time  transform  actually  has  the  form 

u  i— >  G[u]u  . 

Such  operators  can  be  said  to  “have”  a  canonical  relation,  even  though  they  are 
nonlinear  —  namely,  that  of  G[it].  When  u  changes  the  canonical  relation  changes, 
shifting  (infinitesimally:  differentiating)  the  singularities  of  the  image.  In  order 
that  8G  may  have  the  same  sort  of  bounds  as  G  —  in  terms  of  8u  and  of  u  —  it 
is  necessary  to  compose  G  with  another  (necessarily  equally  irregular)  transform, 
having  the  inverse  canonical  relation.  That  is,  we  need  to  find  H  and  Q  so  that 
Q  ~  G  o  H  has  a  derivative  8Q  with  the  same  qualitative  estimates  as  Q  itself.  It 
is  natural  to  look  for  H  also  in  the  form  of  an  oscillatory  integral,  and  to  expect  Q 
to  be  pseudo- differential  —  this  is  exactly  what  happens  in  the  case  of  travel-time. 
That  is,  we  seek  H[u ]  so  that 


Q[u]  =  G[H[u]u]H[u] 


is  pseudodifferential,  with  symbol  depending  smoothly  on  u. 

In  the  case  at  hand,  H[u ]  is  essentially  uniquely  specified  by  this  requirement, 
since  its  canonical  relation  is  specified.  To  construct  H  explicitly,  it  is  necessary 
to  explain  first  the  canonical  relation  associated  to  the  seismogram  operator.  For¬ 
tunately,  the  linearized  seismogram  operator  has  the  same  canonical  relation,  and 
this  latter  has  been  discussed  extensively  —  see  e.g.  Beylkin  (1985),  Rakesh  (1988), 
for  example.  The  matter  is  clearest  when  the  oscillatory  integral  G  is  expressed  as 
a  generalized  Radon  transform  (the  viewpoint  emphasized  by  Beylkin). 

For  simplicity  we  discuss  only  the  two-dimensional  case.  Let  R  =  {(aq,£2)  : 
— X  <  x\  <  X,  0  <  X2  <  Z}  and  suppose  that  C  is  a  class  of  C 1  metrics  of  the 
form  c~ 2  dxrfxi  in  R  for  which  the  geodesics  between  points  in  R  and  points  in 
the  surface  segment  [— XR,  Xr]  x  {0},  xR  <  X,  have  no  conjugate  points  and  lie 
entirely  in  R.  This  hypothesis  holds  for  c  =  const.,  hence  for  a  C2-neighborhood 
of  the  constants.  This  “no-caustics”  hypothesis  is  necessary  if  results  as  simple  as 
those  to  follow  are  to  hold,  as  is  shown  in  Percell’s  thesis  (1989). 

Denoting  by  r(x,y )  the  geodesic  distance  from  x  to  ?/,  the  reflection  phase  for 
V  €  R,  xre[-XR,XR]: 


<KC ;  xr;  y)  =  r(0,  y)  +  r(y,  (xr,  0)) 
is  well-defined  for  c  £  C.  For  /  €  C°°(R)  define 

G[c]f(xr,t)  =  j  f. 

Jy:<p(c-,xr,y)=t 

That  is,  G[c]f  is  the  generalized  Radon  transform  of  /  over  the  equal-time  curve 
{y  :  (j){c\  xT,  y)  =  t }  generated  by  C.  For  c  =  const.,  the  equal-time  curves  are  arcs 
of  ellipses  with  focii  at  (0,0),  (xr,0)  and  radii  proportional  to  t. 

For  smooth  c,  the  impulsive  seismogram  operator 

S[c]  =  u|X2=0 

where 

1  d2u  .  /./  . 

—  — ■  -  A u  =  S(t)S(x)  u  =  0,  t  <  0 
&  oV 

can  be  expressed  as 

S[c]  =  p[c]G[c]c 

for  a  suitable  pseudodifferential  p[c],  so  S  “has  the  same  canonical  relation  as  G .” 

The  class  of  operators  with  the  inverse  canonical  relation  to  that  of  G[c\  all 
differ  from  the  weighted  adjoint  operator 

fXR 

G*[c]v(y)  =  /  dxTb(xr,XR,y)v(xT,<f)(c;xr,y)) 

J-XR 

by  composition  with  the  pseudodifferential  operator. 

Note  that  G*[c\  maps  functions  v(xr,t)  of  “receiver”  and  “time”  to  functions  of 
y,  i.e.  space  —  in  exploration  seismic  parlance,  G*[c\  is  a  migration  operator.  The 
weight  b  has  been  included  for  generality  —  changing  it  will  change  the  result  by 
a  pseudodifferential  operator. 


Since  G*  [c]  depends  on  c,  it  is  not  exactly  right.  We  need  an  operator  defined 
in  terms  of  functions  c(a:r,f),  which  will  function  as  the  equivalent  of  the  travel¬ 
time  velocity  v  of  Section  3.  We  define  such  an  operator  in  terms  of  an  auxiliary 
problem,  as  follows. 

The  “trick”  is  to  construct  directly  the  reflection  phase.  Note  that 

<f>iX\xr,y)  =  \y\  +  \yi  -  xr,y2)\  =:  ®0(xr,y)  ■ 

We  will  need  to  control  first  derivatives  of  the  reflection  phase  —  but  $o  is  not  C 1 
near  y  =  0,  y  =  (27, 0).  So  we  define  instead  for  8  >  0,  k  =  1, 2, . . . 

$(®r,  y)  =  $o(xr,y)  +  A$i(xr,  y), 

G  Ck([-XR,XR ]  x  R)  and 
$i(zr,y)  =  0  if  y2  <  8  .} 

It  is  easy  to  see  that  if  c  G  C  satisfies  c(y)  =  1  for  y2  <  8 ,  then  •,  ■)  G  V}.  We 
give  Vg  the  Ck  topology  in  the  obvious  way. 

Pick  7  G  (^“(IR)  with  7 (y)  =  0  for  y2  <  6,  7 (y)  =  1  for  y2  >  28.  For 
f  G  C3([-Xfl,  XR]  x  IR+),  $  G  Vks  define 

fXR 

$,(f)  =  /  dxrr(xr,$(xry)) 

J-Xr 

n[$,r]  =  l  +  7$*[f] 

We  consider  the  fixed-point  problem 

:=  </>(u[$,f];-,-)  =  $  . 

If  $  =  $[r]  solves  this  fixed-point  problem,  then  the  operator  4>,  has  the  same 
canonical  relation  as  (j*[n[$, f]],  hence  the  inverse  canonical  relation  to 
or  S'[n[$,f]].  Hence 

r  1— ►  ,f>[t;[$[r], r]] 

is  “nonlinear  pseudodifferential”  and  has  some  hope  of  extending  to  a  differentiable 
map  with  coercive  derivative  on  interesting  classes. 

Note  that  if  ||f||c*+i  is  sufficiently  small,  and  $  =  $0  +  as  above  with 
||$i||ci  <  1,  then  v[<f>,r]  is  positive,  and  even  a  member  of  C  with  v[<j>,  r](y)  =  1  if 
y2  <  8.  Thus  Tf  maps  Vk  into  itself  for  ||f||c*+i  small  enough.  Also,  $*[0]  =  0,  so 
v[<j>,  0]  =  1  for  any  $  G  Vs-  Thus  T0$  =  $0,  and  therefore  T0  has  a  unique  fixed 
point,  namely  $o-  If  3>,  G  Vk 

$  —  $0  +  ^1  1  $/  =  $0  +  , 

then  it  is  easy  to  estimate 


so 


H$,f]  -v[$',f]||c*  <  V\\f ||C*+i||$i  -  $'xj|c* 
for  V  depending  on  6. 

Next  we  need  a  few  facts  about  the  mapping  from  a  velocity  c  to  its  travel-time 
function  r(x,y).  We  will  assume  that  x2  =  0  and  c(y )  =  1  for  y2  <  6.  Then  for 
\x  —  y\  <  6 ,  r(x,y)  =  \x  —  y |.  For  y  with  y2  >  6,  if  c  6  C  then  r(x,y)  is  a  C1 
solution  of  the  eikonal  equation 

c2|Vr|2  =  1  . 

If  moreover 

c  €  C°M  =  {c  €  C  :  ||  log  c||co(H)  <  M } 
then  we  obtain  the  obvious  estimate 


llTllc*(fl)  <  Cm 

for  a  suitable  constant  >  0. 

We  also  need  bounds  on  the  second  derivatives  of  r,  which  can  be  got  as  follows. 
The  Hamiltonian  system  associated  to  the  eikonal  equations  is 

.  o  .  .  ,o  N/  c 

y  =  c  rj  r)  =  —  cvc\r]\  — - . 

c 

Denote  by  (y(t,  yi),  £(t,  yi))  the  solution  starting  (at  t  =  0)  at 


Then 


(2/1,  <5, 


-  yi  x 

s  -  y|  ’  |®  -  yr  ’ 


’■(y^yi).*)  =  ‘r((yi,5),«)  + 1 


From  differentiation  of  the  eikonal  equation 


Vr  •  V 


5r 

5a:,- 


=  -2c"3 


5c 

dxi 


we  see  that  the  derivative  of  Jy-  along  the  rays  is  bounded  by  || c||  1 .  On  the  other 
hand, 


(*,yi) 


w-(c2(y{t,yi))i(t,yi)) 

oy  1 

2c(y(t,yi))  Vc(y(t,yi)) 
+  c2(y(f,yi))J^(t,yi) 


^-(Vlog  c(y(«,yi))) 

5yi 

dv 

VV  log  c(y(i,yx))  •  ^-(*,yi) 


y(^yi) 


whence  a  standard  O.D.E.  estimate  gives 


<  const.  ||logc||c2(H) 


when  y(t,yi )  £  R.  The  assumption  that  no  conjugate  points  exist  implies  that 
(f,t/i)  i — >  y(t,yi)  is  a  global  coordinate  system  in  R,  so  and  y  are  linearly 
independent.  Moreover,  it  is  possible  to  show  that  the  condition  number  of 


^(l-fUVlogcllcO-1 

(this  is  an  explicit  bound  guaranteeing  global  geodesic  coordinates).  Since 


drA- 

dyJ 


Wr.^)  =VVlog  c-p- 
dy  i )  dyx 


the  condition  number  estimate  shows  that 


HVVrlloo  <  C  [(1  -  t||VlogC||cl)_1||VlogC||cl  +  1  . 


Define 


e«p  =  {c€CVi|viogc||0,  <r}  • 

Then  if  P  is  small  enough,  there  is  a  constant  K2  >  0  so  that 

c  €  C2m  p  =>  || —  $o||c2  <  K2  . 
Next,  suppose  C2,c2  £  Cm,p-  Then 


Vn  •  -  Vr2  •  =  2  (c2-3^  - 

oxi  oxi  \  oxi 

Vn  •  V  +  (Vr,  -  Vr2)  •  V 


dxi  dxi 


-3  dc2  _3  dc\ 
2  dxi  dxi 


Integrating  along  the  rays  for  T\  and  using  T\  =  r2  at  y2  —  6  we  get 

||Tl  -  T2||Ci  <  ^llcl  -  ^"2  He1 

where  L  is  a  function  of  M  and  P  (and  in  particular  of  ||Vlogc,||ci),  whence 
ll<Hci;  •» •)  -  -7  -)llc7i  <  l\\c\  -  c2\\Ci  ■ 

Now  we  are  ready  to  string  the  various  facts  together.  Set 


B  =  V\  n  {||$  —  $o||c2  <  R2}  ■ 


For  $£6, 

||u[$,r]  -  1||C2  <  ||u[$,f]  -  u[$0,f]||c2  +  ||5[$0,r]  -  5[$o,0]||c2 

<  VK2\\r\\c3  +  \\^{f}\\C2 

<  {VK2  +  W)\\r\\c3 

where  W  is  an  upper  bound  for  the  length  of  the  curves  xr  (xr:  <h(y,  xr))  as  y 
ranges  over  R  and  $  over  B. 

Consequently  there  exists  Q  >  0  so  that  if  ||f  ||cs  <  Q,  then  $  £  B  implies 

u[$,  r]  £  Cm  P  . 

Q  depends  on  V ,  M,  and  P  (and  on  K2  and  W,  though  these  depend  on  M  and 
P.)  Consequently,  if  $  £  B,  ||r||3  <  Q,  then 

||^(u[$,r];.,.)-^o||c2  <K2, 

i.e.  Tf  maps  B  into  itself.  Next  note  that  for  <J>i,$2  £  B, 

<  Vi||r||C2||$a-$2||c>  • 

Thus  by  making  Q  possibly  smaller  yet,  we  can  arrange  that  Tf  is  a  contraction  on 
B  —  but  only  in  the  sense  of  C1,  not  C2.  So  we  don’t  get  a  solution  of  the  fixed 
point  problem  in  B  —  nonetheless,  the  sequence  of  iterates  {T"[$r]}  is 

(a)  bounded  in  C 2 

(b)  convergent  in  Cx  to  a  C1  solution  of  7f$  = 

This  result  is  only  a  glimpse  of  the  actual  state  of  affairs  concerning  the  “multi¬ 
dimensional  travel  time  transformation,”  which  the  author  expects  to  occupy  a 
central  position  in  the  study  of  wave  propagational  inverse  problems. 
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